NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

NeuralFeels with neural fields: Visuotactile perception for in-hand manipulation

https://doi.org/10.1126/scirobotics.adl0628

Suresh, Sudharshan; Qi, Haozhi; Wu, Tingfan; Fan, Taosha; Pineda, Luis; Lambeta, Mike; Malik, Jitendra; Kalakrishnan, Mrinal; Calandra, Roberto; Kaess, Michael; et al (November 2024, Science Robotics)
Yashinski, Melisa (Ed.)
To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object’s pose and shape. The status quo for in-hand perception primarily uses vision and is restricted to tracking a priori known objects. Moreover, visual occlusion of objects in hand is imminent during manipulation, preventing current systems from pushing beyond tasks without occlusion. We combined vision and touch sensing on a multifingered hand to estimate an object’s pose and shape during in-hand manipulation. Our method, NeuralFeels, encodes object geometry by learning a neural field online and jointly tracks it by optimizing a pose graph problem. We studied multimodal in-hand perception in simulation and the real world, interacting with different objects via a proprioception-driven policy. Our experiments showed final reconstructionFscores of 81% and average pose drifts of 4.7 millimeters, which was further reduced to 2.3 millimeters with known object models. In addition, we observed that, under heavy visual occlusion, we could achieve improvements in tracking up to 94% compared with vision-only methods. Our results demonstrate that touch, at the very least, refines and, at the very best, disambiguates visual estimates during in-hand manipulation. We release our evaluation dataset of 70 experiments, FeelSight, as a step toward benchmarking in this domain. Our neural representation driven by multimodal sensing can serve as a perception backbone toward advancing robot dexterity.
more » « less
Full Text Available
ShapeMap 3-D: Efficient shape mapping through dense touch and vision

https://doi.org/10.1109/ICRA46639.2022.9812040

Suresh, Sudharshan; Si, Zilin; Mangelson, Joshua G.; Yuan, Wenzhen; Kaess, Michael (May 2022, IEEE International Conference on Robotics and Automation)

Knowledge of 3-D object shape is of great importance to robot manipulation tasks, but may not be readily available in unstructured environments. While vision is often occluded during robot-object interaction, high-resolution tactile sensors can give a dense local perspective of the object. However, tactile sensors have limited sensing area and the shape representation must faithfully approximate non-contact areas. In addition, a key challenge is efficiently incorporating these dense tactile measurements into a 3-D mapping framework. In this work, we propose an incremental shape mapping method using a GelSight tactile sensor and a depth camera. Local shape is recovered from tactile images via a learned model trained in simulation. Through efficient inference on a spatial factor graph informed by a Gaussian process, we build an implicit surface representation of the object. We demonstrate visuo-tactile mapping in both simulated and real-world experiments, to incrementally build 3-D reconstructions of household objects.
more » « less
Full Text Available
Tactile SLAM: Real-time inference of shape and pose from planar pushing

https://doi.org/10.1109/ICRA48506.2021.9562060

Suresh, Sudharshan; Bauza, Maria; Yu, Kuan-Ting; Mangelson, Joshua G.; Rodriguez, Alberto; Kaess, Michael (May 2021, IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available
ARAS: Ambiguity-aware Robust Active SLAM based on Multi-hypothesis State and Map Estimations

https://doi.org/10.1109/IROS45743.2020.9341384

Hsiao, Ming; Mangelson, Joshua G.; Suresh, Sudharshan; Debrunner, Christian; Kaess, Michael (October 2020, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Full Text Available

Search for: All records